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The relationship between Cartesian coordinates and Euclidean geometry is well known. The theorems 
from Euclidean geometry don’t mention anything about coordinates, but when you need to apply those 
theorems to a physical problem, you need to calculate lengths, angles, et cetera, or to do geometric proofs 
using analytic geometry. 

Homogeneous coordinates and projective geometry bear exactly the same relationship. Homogeneous co- 
ordinates provide a method for doing calculations and proving theorems in projective geometry, especially 
when it is used in practical applications. 


Although projective geometry is a perfectly good area of “pure mathematics’, it is also quite useful in 
certain real-world applications. The one with which the author is most familiar is in the area of computer 
graphics. Since it is almost always easier to understand mathematics when there are concrete examples 
available, we’ ll use computer graphics in this document as a source for almost all the examples. 


The prerequisites for the material contained herein include matrix algebra (how to multiply, add, and 
invert matrices, and how to multiply vectors by matrices to obtain other vectors), a bit of vector algebra, 
some trigonometry, and an understanding of Euclidean geometry. 


1 Computer Graphics Problems 


We’ II begin the study of homogeneous coordinates by describing a set of problems from three-dimensional 
computer graphics that at first seem to have unrelated solutions. We will then show that with certain 
“tricks”, all of them can be solved in the same way. Finally, we will show that this “same way” is in fact 
just a recasting of the original problems in terms of projective geometry. 


1.1 Overview 


Much of computer graphics concerns itself with the problem of displaying three-dimensional objects 
realistically on a two-dimensional screen. We would like to be able to rotate, translate, and scale our 
objects, to view them from arbitrary points of view, and finally, to be able to view them in perspective. 
We would like to be able to display our objects in coordinate systems that are convenient for us, and to be 
able to reuse object descriptions when necessary. 


As a canonical problem, let’s imagine that we want to draw a scene of a highway with a bunch of cars 
on it. To simplify the situation, we’ll have all the cars look the same, but they are in different locations, 
moving at different speeds, et cetera. 


We have the coordinates to describe a tire, for example, in a convenient form where the axis of the tire is 
aligned with the x-axis of our coordinate system, and the center of the tire is at (0,0,0). We would like 
to use the same description to draw all the tires on a car simply by translating them to the four locations 
on the body. Our car body, of course, is also defined in a nice coordinate system centered at (0,0, 0) and 
aligned with the z, y, and z-axes. 


Once we get the tires “attached” to the body, we’d like to make multiple copies of the car in different 
orientations on the road. The cars may be pointing in different directions, may be moving uphill and 
downhill, and the one that was involved in a crash may be lying upside-down. 


Perhaps we want to view the entire scene from the point of view of a traffic helicopter that can be anywhere 
above the highway in three-dimensional space and tilted at any angle. 


We’ll deal with the viewing in perspective later, but the first three problems to solve are how to translate, 
rotate, and scale the coordinates used to describe the objects in the scene. Of course we want to be able 


to perform combinations of those operations!. 


We assume that every object is described in terms of three-dimensional cartesian coordinates like (x, y, z), 
and we will not worry how the actual drawing takes place. (In other words, whether the coordinates are 
vertices of triangles, ends of lines, or control points for spline surfaces—it’s all the same to us—we just 
transform the coordinates and assume that the drawing will be dragged around with them.) 


Finally, except in the cases of rotation and perspective transformation, it is easier to visualize and ex- 
periment with two-dimensional drawings and the extension to three dimensions is obvious and straight- 
forward. 


1.2. Translation 


Translation is the simplest of the operations. If you have a set of points described in cartesian coordinates, 
and if you add the same amount to the z-coordinate of every one, all will move by the same amount in 
the x-direction, effectively moving the drawing by that amount. Adding a positive amount moves to the 
right; a negative amount to the left. 


Similarly, additions of a constant value to the y or z-coordinate cause uniform translations in those direc- 
tions as well. The translations are independent and can be performed in any order, including all at once. 
If an object is moved one unit to the right and one unit up, that’s the same as moving it one unit up and 
then one to the right. The net result is a motion of length /2 units to the upper-right. 


We can define a general translation operator T(x, y, z) as follows: 
Tee sty ste (x, YZ) ime (x ttz,yt ty, z+ tz), 


where f,, ¢,, and ¢, are the translation distances in the directions of the three coordinate axes. They may 
be positive, negative, or zero. 7;,,2,,t, 18 a function mapping points of three-dimensional space into itself. 
This Tee sty te has an inverse, Ten ty ste —a a —te,—t,,-t, Which simply translates in the opposite directions 
along each coordinate axis. Clearly: 


Tee sty te Cee aes (z,y,z)) = 
(@-—tg tte, y—ty + ty, z— tz + tz) — (x,y, 2). 


z 


1.3. Rotation about an Axis 


We’ll begin by considering a rotation in the z-y plane about the origin by an angle @ in the counter- 
clockwise direction. Clearly, all of the z-coordinates will remain the same after the rotation. We will 
denote this rotation by ®, 4. The z subscript is because the rotation is, in fact, a rotation about the z-axis. 


Figure 1 shows how to obtain the equation for the rotation. We begin with a point P = (2, y) and we 
wish to find the coordinates of P’ = (z’, y’) which result from rotating P by an angle @ counter-clockwise 
about the z-axis. The equations are most easily obtained by using polar coordinates, where r = /a? + y? 
is the distance from the origin to P (and to P’) , and ¢ is the angle the line connecting the origin to P 
makes with the z-axis. 


As we can see in the figure, x = rcos ¢ and y = rsin @, while 2’ = rcos(@+ 6) andy’ = rsin(¢ + 6). 


'There is one other operation that can easily be performed called “shearing”, but it is not particularly useful. The general 
homogeneous transformations that we’ll discover also handle all the shearing operations seamlessly. 


—(}—~ 


Uo | oN P= (x, y’) = (r cos (+8), r sin (9+8)) 
Va ae 


/\ 
/ /e \P = (x, y) = (rcos 6, r sin 6) 


Figure 1: Rotation about the z-axis 


Using the addition formulas for sine and cosine, we obtain: 


(z’,y') = (rcos(¢+9),rsin(é+9)) 
= (rcos¢cosé —rsindsin#,rcos¢sin#d +rsin¢cosé) 
= (gcosé — ysind@,zsiné + ycosé@). 


Thus we obtain: 
Rz,6(£,y,z) = (xcosé — ysin#, xsin# + ycos8, z). (1) 


As was the case with translation, rotation in the clockwise direction is the inverse of rotation in the 
counter-clockwise direction and vice versa: Rep = R,,-6. It’s a good exercise to check this by applying 
equation | and its inverse to a point (2, y). 


The equations for rotation about the x and y axes can be obtained similarly, and for reference, here are all 
three equations together: 


Raoa(t,y,z) = (#,ycosé —zsind,ysin# + zcos6) (2) 
Ryo(f,y,2) = (xcosd+zsind, y, —rsin@ + zcos8) (3) 
Rz(L,y,2) = (xcosd—ysind,rsind + ycos@,z). (4) 


At first glance, it appears that we have made an error in the signs in equation 3 for the rotation about 
the y-axis, since they are reversed from those for rotations about the x and z-axes. But all are correct. 
The apparent problem has to do with the fact that the standard three-dimensional coordinate system is 
right-handed—if the x and y-axes are drawn as usual on a piece of paper, we must decide whether the 
positive z-axis is above or below the paper. We have chosen to place positive z values above the paper. 


A left-handed system, where the positive z-axis goes down, is perfectly reasonable, but the usual conven- 
tion is the other way, and the difference is that the signs in some operations are switched around. The 
normal orientation is called a “right-handed” coordinate system and the other, “left-handed”. Even if we 
had used a left-handed system, the signs would not be the same throughout the equations 2-4; a different 
set of signs would be flipped. 


Think of the orientation of a rotation as follows: to visualize rotation about an axis, put your eye on 
that axis in the positive direction and look toward the origin. Then a positive rotation corresponds to a 
counter-clockwise rotation. We’ve done this with rotation about the z-axis—your eye is above the paper 
looking down on a standard z-y coordinate system. But visualize the situation looking from the positive 
y and positive z directions in a right-handed coordinate system. Looking from the z direction, the y goes 
to the right and the z goes up, but looking from the positive y-axis, the z-axis goes to the left, while the 
Z-axis goes up. 


1.4 General Rotation 


What if you want to rotate about an axis that does not happen to be one of the three principal axes (the 
x, y, and z axes are called the “principal axes”)? What if you want to rotate about a point other than the 
origin? It turns out that both of these problems can be solved in terms of operations that we already know 
how to do. Let’s begin by looking at rotation about non-principal axes that do pass through the origin. 


The strategy is this: we will do one or two rotations about the principal axes to get the axis we want 
aligned with the z-axis. Then we’ll rotate about the z-axis, and finally, we’ll undo the rotations we did 
to align your axis with the z-axis. To do this, assume that the axis of rotation you want points along 
the vector (x,y, z). We’d like to have it along another vector with its y and z-coordinates zero. If the 
y-coordinate is non-zero, do a rotation about the z-axis to make the y-coordinate zero. If the z-coordinate 
is still non-zero, do a rotation about the y-axis to make the z-coordinate zero. Since the rotation is about 
the y-axis, the y-coordinate (which you previously rotated to be zero) will not be affected. Thus at most 
two rotations will align an arbitrary axis with the z-axis. 


So if the problem is to rotate about the origin by an angle @, but with an arbitrary axis, what we need to 
do is perform two rotations to do the alignment. For concreteness, assume those rotations are by ¢ about 
the x-axis and then by 7% about the y-axis. If P = (x,y, z) is any point in space, the new point P’ that 
results from a rotation of P about this oddball axis is: 


P= Rz,—o(Ry,—v (Rao (Ry,v(Ra,o(P))))). (5) 


To interpret equation 5 remember that the operations are performed from the innermost parentheses out- 
ward. First, rotate P about the x-axis by an angle @. Rotate the resulting point about the y-axis by an 
angle 7. At this point, the oddball axis is aligned with the z-axis, so the rotation you wanted to do origi- 
nally can now be done with the Rg operator. Finally, the two outermost operations return the axis to its 
original orientation. 


Obviously, combining five levels of calculations from equations 2-4 will result in a nightmarish system of 
equations, but something that is not difficult for a computer to deal with. 


Finally, what if the rotation is not about the origin? This time the translation operations from Section 1.2 
come to the rescue together with a similar strategy to what we used above for a non-standard axis. We 
simply need to translate the center of rotation to the origin, perform the rotation, and translate back. If we 
denote by ® any sort of rotation about any axis through the origin (possibly constructed as a composition 
of five standard rotations as illustrated in equation 5 above), and we wish to perform that rotation about 
the point (a, y, z), here is the equation that relates an arbitrary point P to its position P’ after rotation: 


Pl= Ta,y,2(R(T—2,-y,-z(P))). 


1.5 Scaling (Dilatation) and Reflection 


What if we want to make things larger or smaller? For example, if we have the coordinates that describe 
an automobile, what are the coordinates that would describe a scale model of an automobile that is 10 
times smaller than the original? 


It’s fairly clear that if we multiply all of our coordinates by 1/10 we will get a model that’s 1/10 the 
size, but notice that if the original coordinates had described a car a mile from the origin, the resulting 
miniature car would be only about 1/10 mile from the origin. Thus our strategy does scale down the size 
of the car, but the scaling occurs about the origin. If we wanted to scale about the original car a mile from 
the origin, we could use the same trick we did with rotations—translate the car to the origin, multiply all 
the coordinates by 1/10, and finally translate the resulting coordinates back to the original position. 


Non-uniform scaling is also easy to do—if we wish to make an object twice as large in the z-direction, 
three times as large in the y-direction, and to leave the z size unchanged, we simply multiply all the 
z-coordinates by 2, all the y-coordinates by 3, and leave the z-coordinates unchanged (or equivalently, 
multiply them all by 1). 


Thus, the most general scaling operation about the origin is given in terms of the scale factors sz, sy, and 
Sz, and the formula for such a function is: 


5268, (8 (x,y, 2) = (Se, SyY, 822). (6) 


In equation 6, if the scale values are larger than 1, the object’s size increases; if they are less than 1, it 
decreases, and if they are equal to one, the size is unchanged. 


Negative scale values correspond to a combination of a size change and a reflection across the plane 
perpendicular to the axis in question passing through the origin. If s, = —1, there is no size change, but 
the object is reflected through the plane x = 0. 


The same ideas used previously can be used to produce scaling functions in directions not aligned with 
the principal axes. If scaling is to occur about a non-principal axis, rotate that axis to be the z-axis, scale 
in the x-direction, and then rotate back. 


We’ve considered positive and negative scalings, but what happens if, say, s, = 0? All z-values are 
collapsed to zero, and it’s as if the entire three-dimensional space is projected to the plane z = 0. We 
are going to need this operation (or something similar) when we draw our three-dimensional space on a 
two-dimensional computer screen. 


As long as all three values sz, sy, and sz are non-zero, the scale function has an inverse. It should be 
obvious that Oye wiiibs = S1/s,,1/sy,1/s.> and the inverse only makes sense if all three scale values are 
non-zero. From a physical point of view it’s easy to see why. If the scaling operation is really a projection 
to a plane, there is no way to undo it. Any point on the plane could have come from any of the points on 


the line through space that projected to that point. 


2 Combining Rotation, Translation, and Scaling 


As we have seen above, it is often advantageous to combine the various transformations to form a more 
complex transformation that does exactly what we want. If we simply do the algebra, things can get 
complicated in a hurry. To illustrate the problem, consider a relatively simple problem—we’d like to 
rotate clockwise by an angle 6 about an axis parallel to the z-axis but passing through the point (x1, y1, 0). 


The combined transformation of the point (z, y, z) to (a’, y’, 2’) is this: 


(2',y’, 2’) = Tex ,y1,0(Ra,6(T—21,-y1,0(2, Y 2))) 
= ((#%—21)cosé — (y—y1)sin@+ x, 
(x — 21) sin@ + (y—y1) cos + 1,2), 


and this one is pretty straight-forward. Imagine what the combination of 5 rotations would look like. 


But there is an easy method, and we’ll begin by looking at how to combine rotations. It turns out that all 
of the rotations about the origin can be easily expressed in terms of matrix multiplication. If we consider 
our points P = (2, y, z) to be three-dimensional column vectors”, every rotation corresponds exactly to a 
multiplication by a certain matrix. Here is the matrix that corresponds to a counter-clockwise rotation by 
an angle @ about the z axis: 


x cos@é —sin@ 0 x xcos@ — ysind 
Rzoely|= |sin@ cosé@ 0 y | = | esin@+ycosé 
z 0 0 1 z z 


The other two matrices are equally simple: 


2For technical reasons, it is better to represent the vectors as column vectors. We could use row vectors, but there are disadvan- 
tages that are difficult to explain at this point. However, to save space in the text, we will write the components as usual: (x, y, z) 
within paragraphs with the understanding that when they are expressed as vectors in equations, they will be turned vertical to make 
column vectors. When we talk about projective lines later on, we’ll need to use actual row vectors, and to indicate this within a 
paragraph, we’ll use the somewhat surprising (x, y, z)7. 


£ cos@ 0. siné £ xcosé + zsind 
Ryel{yJ=[ 0 1 0 y| = y : 
Zz —sin@ 0 cosé Zz —xsin@ + zcosé 
z 1 0 0 z £ 
Roa |y|= [0 cos@ —siné y | = | ycos@ — zsin#@ 
z 0 sin@ cosé z ysin@ + zcos@ 


The beautiful thing about the matrix representation is that repeated rotations about different axes corre- 
sponds to matrix multiplication. Thus if you need to rotate a million vertices that describe the skin of a 
dinosaur in Jurassic Park VI about some weird axis, you don’t need to multiply each point by five different 
matrices; you simply multiply the five matrices together once and multiply each dinosaur point by that 
one matrix. It’s a savings of almost four million matrix multiplications which can take time, even on a 
fast computer. 


The scaling matrix is even simpler: 


x Se O 0 x Sek 
Ss.,sy,8.|Y|=|90 sy O Yl = | Syy 
z 0 O s, Z 82% 


It can be combined in any combination with the rotation matrices above to make still more complex 
transformations. As with the rotation matrices alone, the combination of operations simply corresponds 
to matrix multiplication. 


Unfortunately, when we try to do the same thing with the seemingly simpler translation operation, we are 
dead. It just will not and cannot work this way. It’s easy to see why. If you multiply the column vector 
(0,0, 0) by any 3 x 3 matrix, the result will be (0,0, 0). The origin is fixed by every matrix multiplication, 
yet for a translation, we require that the origin move. 


Fortunately, there is a trick? to get the job done. We will simply add an artificial fourth component to each 
vector and we will always set it to be 1. In other words, the point we used to refer to as (#, y, z), we will 
now refer to as (x,y, z, 1). If you need to find the actual three-dimensional coordinates, simply look at 
the first three components and ignore the 1 in the fourth position’. 


Of course none of the matrices above will work either, until we add a fourth row and fourth column with 
all the elements equal to zero except for the bottom corner. For example: 


x cos@é -—sin@é 0 0 x xcos 6 — ysind 

R y sin@ cos@ 0O O y| _ | vsiné+ycosé 
al We 0 0 1 Of fz]— z 
1 0 0 0 1 1 1 


But now, with this artificial fourth coordinate, it is possible to represent an arbitrary translation as a matrix 
multiplication: 


x 100k x B+ ty 

y| |O 104, y| |ytt, 
Teestute | 2) =10 0 1 t.||2z| |z+t 

1 00 0 1 1 


Using this scheme, every rotation, translation, and scaling operation can be represented by a matrix mul- 
tiplication, and any combination of the operations above corresponds to the products of the corresponding 
matrices. 


3 Computer scientists, of course, refer to a “trick” as a “hack”. 
4But if it is not 1, beware. We’ll handle this case later, when we run into the problem head-on. For now, things are nice. 


3 A Simple Perspective Transformation 


We know that by setting one of the scale factors to zero, we can collapse all of the z-coordinates, say, to 
z = 0. If we think of our computer screen as having x and y-coordinates in the usual way, we want to do 
something like this to find the screen coordinates for our points. 


But setting the z scale factor to zero simply projects each point in space to the x-y plane in a perpendicular 
direction. To model the real world, we’d like to imagine looking at real three-dimensional objects, and 
projecting them, wherever they are, to a rectangular piece of glass (the computer screen) that is a few 
inches in front of our eye. These rays are not projected perpendicular to the screen; they are all projected 
at the eye, and they should be drawn on the screen wherever the ray from the object to the eye hits the 
screen. 


_P = (x, y Zz) 
IP? =(7z yh 1) 
I 


Figure 2: A Simple Perspective Projection 


For definiteness, imagine that you are looking from the origin in the direction of the negative z-axis. 
Remember that if you look from the positive z-axis toward the negative, the z-y plane looks normal to 
you, with the positive x-axis to the right and the positive y-axis pointing up. Imagine that we would like 
to project all the points in front of you (they will be the points with negative z-coordinates) onto the plane 
one unit in front of you (it will have z-coordinate equal to —1). Most computer graphics folks don’t like 
working with these negative coordinates, so they now switch to a left-handed coordinate system so that 
the point z = I is in front of them, looking into the plane. We’ll do that here, just so we don’t need to 
mess with —1 as the z-coordinate of projection. 


Figure 2 shows how we would like to project an arbitrary point P = (x, y, z) to a point P’ on the plane. 
Similar triangles will show you that the coordinates of P’ are (x/z, y/z, 1). 


This is bad news—we’ ve got to do a division by the z-coordinate, and the only operations we can perform 
with matrix multiplications are multiplications and additions. How can we combine this perspective 
transform with the rotations, translations, and scales that work so well with a uniform type of matrix 
representation? 


But with one more “trick” (one more “hack”, if you’re a computer scientist), we can perform the division 
as well. It seems a shame that we have to drag around that final fourth coordinate when it’s always going 
to be equal to 1, but without it the matrix multiplications don’t make sense. Here is the big trick: We will 
consider two sets of coordinates to represent the same point if one is a non-zero multiple of the other. In 
other words, the point (x, y, z, w) represents the same point as does (az, ay, az,aw). 


Normally, of course, w = 1, but this shows how we can convert to our standard form if we get a point 
whose fourth coordinate does not happen to be 1. As long as it is non-zero, we just let a = 1/w, and we 
find that the points (z, y, z, w) and (a/w, y/w, z/w, 1) are equivalent. 


This may seem weird at first, but you should not feel at all uncomfortable with it. After all, you do it all 
the time with fractions. Everybody knows that the fraction 1/3 is normally written that way, but certain 
calculations give results like 2/6, 3/9, or even 17/51. All are equivalent to 1/3. 


When we represent our three-dimensional points with four coordinates, it’s sort of like having a funny 
sort of fraction where the three numerators share a common denominator. And exactly as is the case with 


fractions, as long as the “denominator” (the fourth term) is not equal to zero, we'll have no trouble. 


Here is the matrix that performs the perspective calculation shown in Figure 2 that takes the three- 
dimensional point (2%, y, z) to (#/z,y/z, 1): 


cooor 
core 
eFPreoeqc 
ooo oO 
Ere sy 
NRreOeE 8 


The resulting vector after the multiplication is (x, y, z,z), but remember that we can multiply all the 
components by the same non-zero number and get an equivalent point, and since we’re looking at values 
of z in the half-space with negative z-coordinates, we can multiply all of the coordinates by 1/z to obtain 
the equivalent representation: (#/z,y/z,1, 1). 


There is one problem with this matrix: it is singular, meaning that it does not have an inverse. The reason 
is that our calculation effectively projected every point on the line connecting P and P’ to the same point 
on the plane z = 1. Any function or transformation that maps two points to the same point cannot be 
undone, or inverted. 


At first, this seems like a purely aesthetic problem. After all, aren’t we planning to map all the points 
along that line to the same point on the computer screen? The problem is that in computer graphics, you 
often want to find where to draw them on the screen but to avoid drawing them until you’ve found all 
the points to be drawn there, and then to draw only the point that’s nearest the eye of the viewer. That’s 
because in the real world, objects nearer your eye block your vision of objects behind them. 


We can solve the problem easily, and at the same time, create a non-singular (invertable) matrix. We don’t 
really care that the z coordinate is 1 after transformation—all we care about are the x and y coordinates 
that tell us where to paint the point on the screen. Here’s a better perspective matrix: 


10 0 0 Lz £ 
0 1 0 0 y| _ y 
0011 z z+1]- @) 
001 0 1 A 


This transforms our point (x, y, z, 1) to (#/z,y/z,1+1/z, 1) (at least after we multiply all the coordinates 
by 1/z). The final x and y coordinates are the same as before—everything on that line from P to P’ goes 
to points with the same z and y coordinates—but the z values are all different. The order of the points is 
inverted, but at least it’s easy to see from the transformed coordinates which ones were closer to the eye?. 


The matrix in 7 is non-singular; its inverse is: 


-1 


100 0 100 0 
0 1 0 0 10 1 0 =O 
0011 ~10 00 1 
001 0 001 -1 


Note that since we are dealing with transformations of three-dimensional space, all the transformation 
matrices are 4 x 4. If we were working only with points on the plane (two-dimensional space), the 
transformation matrices would have been 3 x 3. For a line, they would have been 2 x 2, et cetera. 


5In standard computer graphics packages, a more sophisticated version of the perspective matrix is generally used to control the 
various aspect ratios and to control the range of z values that emerge after the calculation. If you’re interested, look at a book on 
computer graphics for the exact forms, but the basic idea is identical to that illustrated in transformation 7. 


4 Homogeneous Coordinates 


Homogeneous coordinates provide a method to perform certain standard operations on points in Euclidean 
space by means of matrix multiplications. As we shall see, they provide a great deal more, but let’s first 
review what we know up to this point. 


The usual cartesian coordinates for a point consist of a list of m points, where n is the dimension of the 
space. The homogeneous coordinates corresponding to the same point require n + 1 coordinates. 


Normally, we add a coordinate to the end of the list, and make it equal to 1. Thus the two-dimensional 
point (x, y) becomes (x, y, 1) in homogeneous coordinates, and the three-dimensional point (x, y, z) be- 
comes (x,y, 2, 1). To learn more, it is often useful to look at the one-dimensional space (points on a line), 
and it is also useful to remember that the same method can be applied to higher-dimensional Euclidean 
spaces. Homogeneous coordinates in a seven-dimensional Euclidean space have eight coordinates. 


The final coordinate need not be 1. Since the most common use of homogeneous coordinates is for one, 
two, and three-dimensional Euclidean spaces, the final coordinate is often called “w’’ since that will not 
interfere with the usual z, y, and z-coordinates. In fact, two points are equivalent if one is a non-zero 
constant multiple of the other. Points corresponding to standard Euclidean points all have non-zero values 
in the final (w) coordinate. 


5 Projective Geometry 


What’s really going on is, in a sense, far simpler. Homogeneous coordinates are not Euclidean coordi- 
nates; they are the natural coordinates of a different type of geometry called projective geometry. 


Here is the real definition of homogeneous coordinates in projective geometry, where we will consider 
the two-dimensional version (with three coordinates) for concreteness. 


Every vector of three real numbers, (x,y, w), where at least one of the numbers is non-zero, corresponds 
to a point in two-dimensional projective geometry. The coordinates for a point are not unique; if @ is any 
non-zero real number, then the coordinates (x,y, w) and (ax, ay, aw) correspond to exactly the same 
point. 


If the w-coordinate is non-zero, it will correspond to a Euclidean point, but if w = 0 (and at least one of 
x or y is non-zero), it will correspond to a “point at infinity” (see Section 7). 


Furthermore, the allowable transformations in (two-dimensional) projective geometry correspond to mul- 
tiplication by arbitrary non-singular 3 x 3 matrices. Obviously, if two matrices are related by the fact that 
one is a constant non-zero multiple of the other, they represent the same transformation. 


The coordinates are called “homogeneous” since they look the same all over the space, and with the 
complete flexibility of multiplication by an arbitrary non-singular matrix we can convert any line to be 
the line at infinity, or convert points at infinity to points in normal space, et cetera. In fact, if you ever 
took a perspective drawing class, the “vanishing points” on the horizon are really places, where, under a 
perspective transformation, points at infinity wind up in normal space. 


There is more to projective geometry, of course. There are equations for lines, for conic sections, methods 
to find intersections of lines or for finding the lines that pass thorough a pair of points, et cetera, but we 
will get to those later. Let us begin by describing a nice mental model for two-dimensional projective 
geometry. 


6 Euclidean and Projective Geometry 


Projective geometry is not the same as Euclidean geometry, but it is closely related. The two have many 
things in common. Just as we can discuss Euclidean geometry in any finite number of dimensions, we 
can do the same for projective geometry. Of course real-world applications are typically two and three- 


dimensional in both geometries, but we’ll sometimes find it useful to think about the one-dimensional 
version of both. 


A nice way to think about various types of geometry is in terms of the allowable operations and the 
properties that are preserved under those operations. For example, in Euclidean geometry, we can move 
figures around on the plane, rotate them, or flip them over, and if we do these things, the resulting trans- 
formed figures remain congruent to the originals. Two figures are congruent if all the measurements are 
the same—lengths of sides, angles, et cetera. 


In projective geometry, we are going to allow projection as the fundamental operation. It’s easy to see 
what projection means in one and two dimensions, so we’ll begin with those. 


Suppose you have a figure drawn on a plane. You can project it to another plane as follows: pick some 
point of projection that is on neither of the planes. Draw straight lines through every point of the original 
figure that pass through the point of projection. The image of each point is the intersection of that line 
with the other plane. 


Note also that we can obtain projections perpendicular to the plane of projection simply by projecting 
from a “point at infinity’”—see Section 7. 


Notice that we have said nothing about the orientations of the two planes—they need not be parallel, for 
example. In your mind’s eye, try to imagine some of these two-dimensional projections. 


7 Visualizing Projective Geometry 


Here are two postulates from two-dimensional Euclidean geometry: 


e Every two points lie on a line. 


e Every two lines lie on a point, unless the lines are parallel, in which case, they don’t. 


In two-dimensional projective geometry, these postulates are replaced by: 


e Every two points lie on a line. 


e Every two lines lie on a point. 


That’s basically the whole difference. How can we visualize a model for such a thing? The model must 
describe all the points, all the lines, what points are on what lines, and so on. 


The easiest way is to take the points and lines from a standard two-dimensional Euclidean plane and add 
stuff until the projective postulates are satisfied. The first problem is that the parallel lines don’t meet. 
Lines that are almost parallel meet way out in the direction of the lines, so for parallel lines, add a single 
point for each possible direction and add it to all the parallel lines going that way. You can think of these 
points as being points at infinity—at the “ends” of the lines. Note that each line includes a single point at 
infinity—the north-south line doesn’t have both a north and south point at infinity. If you “go to infinity” 
to the north and keep going, you will find yourself looping around from the south. Lines in projective 
geometry form loops. 


Now take all the new points at infinity and add a single line at infinity going through all of them. It, too, 
forms a loop that can be imagined to wrap around the whole original Euclidean plane. These points and 
lines make up the projective plane. 


You might make a mental picture as follows. For some small configuration of points and lines that you 
are considering, imagine a really large circle centered around them, so large that the part of the figure of 
interest is like a dot in its center. Now any parallel lines that go through that “dot” will hit the large circle 
very close together, at a point that depends only on their direction. Just imagine that all parallel lines hit 
the circle at that point. This large circular line surrounding everything is the “line at infinity”. 
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Check the postulates. Two points in the Euclidean plane still determine a single projective line. One point 
in the plane and a point at infinity determine the projective line through the point and going in the given 
direction. Finally, the line at infinity passes through any two points at infinity. 


How about lines? Two non-parallel lines in the Euclidean plane still meet in a point (the standard Eu- 
clidean point), and don’t meet anywhere else. Parallel lines have the same direction, so meet at the point 
at infinity in that direction. Every line on the original plane meets the line at infinity at the point at infinity 
corresponding to the line’s direction. 


Note: The projective postulates do not distinguish between points and lines in the sense that if you saw 
them written in a foreign language: 


e Every two glorphs lie on a smynx, 


e Every two smynxes lie on a glorph, 


there is no way to figure out whether a smynx is a line and a glorph is a point or vice-versa. If you take any 
theorem in two-dimensional projective geometry and replace “point” with “line” and “line” with “point”, 
it makes a new theorem that is also true. This is called “duality”—-see any text on projective geometry. 


8 Back to the Homogeneous Coordinates 


So we’ve got a nice mental picture—how do we assign coordinates and calculate with them? The answer 
is that every triple of real numbers (x, y, w) except (0, 0,0) corresponds to a projective point. If w is non- 
zero, (x,y, w) corresponds to the Euclidean point (#/w,y/w) in the original Euclidean plane; (2, y, 0) 
corresponds to the point at infinity corresponding to the direction of the line passing through (0,0) and 
(x, y). Generally, if @ is any non-zero number, the homogeneous coordinates (x, y, w) and (az, ay, aw) 
represent the same point. 


Since projective points and lines are in some sense indistinguishable, it had better be possible to give 
line coordinates as sets of three numbers (with at least one non-zero). If the points are column vectors, 
the lines will be row vectors® (written with a “T” exponent that represents “transpose”), so (a, b,c)" 
represents a line. The point P = (x,y, w) lies on the line L = (a, b,c)" if ax + by + cw = 0. In the 
Euclidean plane, the point (x,y) can be written in projective coordinates as (x,y, 1), so the condition 
becomes az + by + c = 0—high-school algebra’s equation for a line. The line passing through all the 
points at infinity has coordinates (0,0,1)?. As with points, for any non-zero a, the line coordinates 
(a,b,c)? and (aa, ab, ac)" represent the same line. Also, as with points, at least one of a, b, or ¢ must 
be non-zero to have a set of valid line coordinates. 


In matrix notation, the point P lies on the line ZL if and only if LP = 0. This is like the dot product of the 
vectors. Since Lis a row vector and P is a column vector of the same length, the product is essentially a 
1 x 1 matrix, or basically, a scalar. If P = (2, y, w) and L = (a,b,c)", then LP = ax + by + cw. If we 
had chosen to represent lines as column vectors and points as row vectors, that would work fine, too. It 
has to work because points and lines are dual concepts. 


9 Projective Transformations 


Projective transformations transform (projective) points to points and (projective) lines to lines such that 
incidence is preserved. In other words, if 7 is a projective transformation and points P and Q lie on line 
LE then 7(P) and 7(Q) lie on 7 (L). (Warning: 7(£)—the transformation of a line—does not simply 
use the same matrix as for transforming points. See later in this section.) Similarly, if lines LZ and M meet 
at point P, then the lines 7(£) and 7(M) meet at the point 7 (P). 


©Remember that we are writing column vectors in the text as rows, so we’re going to have to have a special notation to indicate 
that a vector in the text is really a row vector. That’s what the transpose will be used for. 
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The reason projective transformations are so interesting is that if we use the model of the projective 
plane described in Section 7 where we’ve simply added some stuff to the Euclidean plane, the projective 
transformations restricted to the Euclidean plane include all rotations, translations, non-zero scales, and 
shearing operations. This would be powerful enough, but if we don’t restrict the transformations to the 
Euclidean plane, the projective transformations also include the standard projections, including the very 
important perspective projection. 


Rotation, translation, scaling, shearing (and all combinations of them) map the line at infinity to itself, 
although the points on that line may be mapped to other points at infinity. For example, a rotation of 5 
degrees maps each point at infinity corresponding to a direction to the point corresponding to the direction 
rotated 5 degrees. Pure translations preserve the directions, so a translation maps each point at infinity to 
itself. 


The standard perspective transformation (with a 90° field of view, the eye at the origin, and looking down 
the y-axis) maps the origin to the point at infinity in the y-direction. The viewing trapezoid maps to a 
square. 


Every non-singular 3 x 3 matrix (non-singular means that the matrix has an inverse) represents a projective 
transformation, and every projective transformation is represented by a non-singular 3 x 3 matrix. If M 
is such a transformation matrix and P is a projective point, then M P is the transformed point. If E is a 
line, LM" represents the transformed line. It’s easy to see why this works: if P lies on L, LP = 0, so 
ILM1MP = 0, so (ELM~-')(MP) = 0. The matrix representation is not unique—as with points and 
lines, any constant multiple of a matrix represents the same projective transformation. 


Combinations of transformations are represented by products of matrices; a rotation represented by matrix 
R followed by a translation (matrix 7) is represented by the matrix 7‘. 


A (two-dimensional) projective transformation is completely determined if you know the images of 4 
independent points (or of 4 independent lines). This is easy to see. A 3 x 3 matrix has nine numbers 
in it, but since any constant multiple represents the same transformation, there are basically 8 degrees of 
freedom. Each point transformation that you lock down eliminates 2 degrees of freedom, so the images 
of 4 points completely determine the transformation. 


Let’s look at a simple example of how this can be used by deriving from scratch the rotation matrix for a 
45° counter-clockwise rotation about the origin. The origin maps to itself, the points at infinity along the 
gx and y axes map to points at infinity rotated 45°, and the point (1, 1) maps to (0, V2). 


If R is the unknown matrix: 


0 0 1 V2 
R10 = k {0 R10 = k| V2 
1 1 0 0 
0 -—Vv2 1 0 
Ri[1|] = ks | v2 Ril] = ka | V2 
0 0 1 1 
The k,...,%4 can be any constants since any multiple of a projective point’s coordinates represents the 


same projective point. The matrix R has basically 8 unknowns, so those 8 plus the 4 &;’s make 12. Each 
matrix equation represents 3 equations, so we have a system of 12 equations and 12 unknowns that can 
be solved. The computations may be ugly, but it’s a straight-forward brute-force solution that gives the 
rotation matrix ® as any multiple of: 


V2/2 —-/2/2 0 
R= |V2/2 V2/2 0 
0 0 1 


There’s nothing special about rotation. Every projective transformation matrix can be determined in the 
same brute-force manner starting from the images of 4 independent points. 
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10 Three-Dimensional Projective Space 


Three-dimensional projective space has a similar model. Take three-dimensional Euclidean space, add 
points at infinity in every three-dimensional direction, and add a plane at infinity going through the points. 
In this case there will also be an infinite number of lines at infinity as well. In three dimensions, points 
and planes are dual objects. 


Projective transformations in three dimensions are exactly analogous. Points are represented by 4-tuple 
column vectors: (x,y,z, w), and planes by row vectors: (a,b,e,d)". Any multiple of a point’s coordi- 
nates represents the same projective point. A point P lies ona plane M if MP = 0. All three dimensional 
projective transformations are represented by 4 x 4 non-singular matrices. 


In three dimensions, the images of 5 independent points (or planes) completely determine a projective 
transformation. (A 4 x 4 matrix has 16 numbers, but 15 degrees of freedom because any multiple rep- 
resents the same transformation. Each point transformation that you nail down eliminates 3 degrees of 
freedom, so the images of 5 independent points completely determine the transformation.)’ 


The brute-force solution has 20 equations and 20 unknowns (there will be 5 &,’s in addition to the 15 
unknowns), and although the solution is time-consuming, it is straight-forward. 


The calculation can be simplified. Suppose you want a transformation that takes P, to Q1, ..., and Ps to 
Qs. Let 

q (1,0, 0,0) 

I, = (0,1,0,0) 

Iz = (0,0,1,0) 

Ig, = (0,0,0,1) 

Is (1,1,1,1) 


Find the transformation P that takes P; to J; and the transformation Q that takes Q; to I;. Because of all 
the zeroes, these are much easier to work out. The transformation you want is O-1Pp. 


10.1. Construction of an Arbitrary Transformation 


Based upon the idea above, here is a purely mechanical method to construct a transformation from any 
four independent points to any other four points in two-dimensional projective geometry. The method can 
obviously be extended to any number of dimensions, where the images of n + 2 points are required to 
determine the transformation. 


Suppose we seek a matrix M that performs the following map: 


M:(21,91,0¥1) 7 (X%1,%1,W) 
M : (%2,y2,W2) —- (Xe, Yo,We) 
M : (%3,y3,ws) + (X3,¥3,Ws) 
M : (a4,y4,wa) > (X4,Y4,W2) 


We will construct the matrix M as the product OP! where: 


The concept generalizes to n-dimensional space. Transformations are denoted by (n+ 1) x (n + 1) matrices having (n + 1)? 
entries, but an arbitrary constant multiple reduces this to (nm + 1)? —-l=n?4+2n= n(n + 2) degrees of freedom. Each time 
you nail down the image of an n-dimensional point, you remove n degrees of freedom, so the images of n + 2 independent points 
are required to completely determine a projective transformation in projective n-space. 
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P:(1,0,0) - (#1,y1,w1) Q0:(1,0,0) — (%4,%,W) 
P:(0,1,0) - (a2, yo, we) Bohs Q:(0,1,0) — (Xe,¥o, We) 
P:(0,0,1) - (a3, ys, ws) Q:(0,0,1) - (X3,Y3, Ws) 
P:(1,1,1) > (44, ys, wa) Q9:(,1,2) 3 (X4,¥1, Wa) 


The construction of P and Q is obviously identical, so we will show the construction of P only. 


We will denote the unknown entries in the matrix P by p,;. As usual, we will also use kj as the arbitrary 
constants in that multiply the homogeneous coordinates of our result. The matrix ? must satisfy: 


Pil Pi2 Ps 1001 kia kot. kgxg kaag 
p21 p22 p23 0 1 0 1J=] ky keys ksy3 kaya | . (8) 
P31 (P3233 0011 kw, kowe kgw3 kaw 


There are thirteen variables including the nine p,; and the four k;, of which we can fix any one. We choose 
to let ks = 1. We can actually perform the matrix multiplication on the left of equation 8 and set k4 = 1 
to obtain: 


Pi Pi2 Ps Pi t+Pi2t Pis kia, kot. kgxg 24 
por Po. Pe3 Po +pot+pes | — | kiy. keyo ksyg ya | - (9) 
P31 P32 P33 Psi + P32 + P33 kjw, kowa kgw3 wa 


From equation 9, we can immediately conclude that pj = kj;2j, po; = kjyj, and that pg; = kjw ;. We 
don’t yet know the values of kj except that k, = 1, but we can now rewrite equation 9 as: 


kya ko Xe k3x3 ky 21 + kore + k323 ky x1 kox2 k3x3 LA 
ky eyo keyg ky + koye + kgys | = | kiy. keys kgy3 ya | - (10) 
ky wy kowe k3w3 kyu. + kowe + k3ws kywy kowe k3ws W4 


The first three columns of equation 10 don’t help at all, but we can re-write the fourth column as follows: 


1 fo £3 ky v4 
Yi Yo Ys ko|=|[ya].- (11) 
Wi We Ws kg ZA 


ky X%, £2 2&3 LA 
ko|=|[% Yo Ys Ya ]- (12) 
kg WwW, Wo ws ZA 


Using the values of the k; obtained from equation 12 and substituting those into the first three columns of 
equation 9 we can find the unknown matrix P: 


Pil Pi2 Pi3 kya kote kgag 
P=|p2 poe pos|l= | ky keye kays 
P31 P32 p33 kw, kowe kgws 
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